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Genetic  factors  appear  to  be  highly  relevant  to  predicting  dif¬ 
ferential  risk  for  the  development  of  post-traumatic  stress 
disorder  (PTSD).  In  a  discovery  sample,  we  conducted  a  ge¬ 
nome-wide  association  study  (GWAS)  for  PTSD  using  a  small 
military  cohort  (Systems  Biology  PTSD  Biomarkers  Consor¬ 
tium;  SBPBC,  N  =  147)  that  was  designed  as  a  case-controlled 
sample  of  highly  exposed,  recently  returning  veterans  with  and 
without  combat-related  PTSD.  A  genome-wide  significant  single 
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nucleotide  polymorphism  (SNP),  rs717947,  at  chromosome 
4pl5  (N  =  147,  (i  =  31.34,  P=  1.28  X  10“®)  was  found  to  associ¬ 
ate  with  the  gold-standard  diagnostic  measure  for  PTSD  (the 
Clinician  Administered  PTSD  Scale).  We  conducted  replication 
and  follow-up  studies  in  an  external  sample,  a  larger  urban 
community  cohort  (Grady  Trauma  Project,  GTP,  N  =  2006),  to 
determine  the  robustness  and  putative  functionality  of  this  risk 
variant.  In  the  GTP  replication  sample,  SNP  rs7 17947  associated 
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with  PTSD  diagnosis  in  females  (N  =  2006,  P=  0.005),  but 
not  males.  SNP  rs717947  was  also  found  to  be  a  methylation 
quantitative  trait  locus  (meQTL)  in  the  GTP  replication 
sample  (N  =  157,  P=  0.002).  Further,  the  risk  allele  of 
rs7 17947  was  associated  with  decreased  medial  and  dorsolateral 
cortical  activation  to  fearful  faces  (N  =  53,  P  <  0.05)  in  the 
GTP  replication  sample.  These  data  identify  a  genome-wide 
significant  polymorphism  conferring  risk  for  PTSD,  which 
was  associated  wdth  differential  epigenetic  regulation  and 
with  differential  cortical  responses  to  fear  in  a  replication 
sample.  These  results  may  provide  new  insight  into  understand¬ 
ing  genetic  and  epigenetic  regulation  of  PTSD  and  intermediate 
phenotypes  that  contribute  to  this  disorder. 

©  2015  Wiley  Periodicals,  Inc. 
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INTRODUCTION 

Although  post-traumatic  stress  disorder  (PTSD)  requires  the 
experience  of  a  life-threatening  trauma  to  occur,  twin  and  family 
studies  have  suggested  that  up  to  30-40%  of  the  risk  for  PTSD 
following  significant  trauma  is  genetically  heritable  (e.g.,  [True 
et  al,  1993] ).  Genetic  studies  to  date  have  been  relatively  small,  but 
have  yielded  approximately  15-20  different  genes  that  might 
associate  with  PTSD  or  PTSD  risk.  These  candidate  studies  are 
weU-discussed  in  several  recent  reviews  [Pitman  et  al.,  2012;  Almli 
et  al.,  2014b].  There  have  been  three  published  GWAS,  at  the  time 
of  this  writing,  focused  on  PTSD  [Guffanti  et  al.,  2013;  Logue  et  al., 
2013;  Xie  et  al.,  2013].  The  genome-wide  significant  findings  have 
yielded  variants  within  or  near  potentially  interesting  candidate 
genes,  including  RORA,  TLL,  and  COBL,  as  well  as  within  a  long 
noncoding  RNA,  lincRNA  AC0687 1 8. 1 .  The  advantage  of  GWAS  is 
that  it  is  hypothesis  neutral,  and  thus  allows  the  identihcation  of 
variants  not  selected  for  candidate  gene/single  nucleotide  poly¬ 
morphism  (SNP)-based  analysis.  However,  there  is  a  stringent 
multiple  test  burden  with  GWAS,  which  is  necessary  to  avoid 
attributing  the  finding  to  false  discovery. 

Since  it  is  difficult  to  get  very  large  sample  sizes  within  a  single 
cohort,  and  individual  heterogeneity  is  introduced  by  them, 
another  approach  is  to  conduct  GWAS  in  highly  homogenous, 
well-characterized  samples  where  exposure  levels  are  controlled  in 
both  cases  and  controls.  For  instance,  a  study  using  very  well- 
matched  trauma  control  cohorts  may  provide  statistical  power 
that  is  not  found  in  epidemiological  samples  of  patient  cases  com¬ 
pared  to  non-patient  controls.  Alternatively,  an  extreme  phenotype 
design,  where  there  is  enrichment  of  extreme  cases  and  controls,  can 
lead  to  increased  odds  ratios,  and  improved  power  to  detect  asso¬ 
ciations,  especially  with  quantitative  traits  [Van  Gestel  et  al,  2000]. 
An  ideal  solution  from  these  methods  would  be  a  discovery  sample  in 
which  the  cases  differ  in  having  extremely  severe  PTSD  compared  to 
those  similarly  exposed  to  trauma,  but  with  no  PTSD  at  aU. 

In  this  study,  we  aimed  to  increase  our  understanding  of  genetic 
risk  for  PTSD  by  conducting  a  GWAS  for  PTSD  in  a  discovery 
sample  with  an  extreme  phenotype  design  and  cases  and  controls 
having  similar  exposures.  Our  discovery  sample  consisted  of  a 
deeply  phenotyped,  weU-controlled,  similarly  exposed,  but  rela¬ 


tively  small  military  sample  (Systems  Biology  PTSD  Biomarkers 
Consortium;  SBPBC)  [Yan  et  al.,  2013;  Lindqvist  et  al.,  2014; 
Yehuda  et  al.,  2015].  To  validate  GWAS  findings,  we  conducted 
a  replication  study  in  a  sample  consisting  of  subjects  from  an 
independent  cohort,  the  civilian  Grady  Trauma  Project  (GTP) 
[Binder  et  al.,  2008;  Almli  et  al.,  2013].  The  GTP  is  a  large, 
primarily  African-American  highly-traumatized  community  co¬ 
hort  recruited  from  the  general  medical  clinics  of  Grady  Memorial 
Hospital,  a  publicly  funded  hospital  that  serves  economically- 
disadvantaged  individuals  in  Atlanta,  Georgia.  Using  the  replica¬ 
tion  cohort  (GTP),  we  are  able  to  expand  our  understanding  of  the 
GWAS  findings  through  functional  genomic  and  neuroimaging 
association  analyses.  Identification  and  replication  of  PTSD  asso¬ 
ciation  across  such  divergent  cohorts,  combined  with  examination 
of  GWAS  findings  at  epigenetic  and  neural  intermediate  phenotype 
levels,  has  led  to  a  novel  and  interesting  potential  PTSD-associated 
genomic  locus. 

METHODS 

Participants  in  Discovery  Cohort 

The  SBPBC  cohort  utilized  for  initial  GWAS  discovery  in  this 
report  was  recruited  as  part  of  a  larger  study  that  is  designed  to 
identify  biomarkers  for  PTSD  diagnosis  in  cross-sectional  and 
longitudinal  studies  in  male  and  female  Iraq  and  Afghanistan 
veterans  [Yan  et  al.,  2013;  Lindqvist  et  al.,  2014;  Yehuda  et  al., 
2015].  Participants  were  recruited  at  New  York  University  Langone 
Medical  Center  and  Icahn  School  of  Medicine  at  Mount  Sinai  and 
from  New  York  City  Veteran  Affairs  medical  centers  at  Manhattan 
and  Bronx,  as  weU  as  other  veteran  service  organizations  and  the 
community.  Ah  procedures  were  approved  by  the  Institute  Review 
Boards  of  NYU  School  of  Medicine  and  Mount  Sinai  School  of 
Medicine.  Participants  gave  written  informed  consent  after  receiv¬ 
ing  a  complete  description  of  the  study.  General  inclusion  criteria 
include  being  a  US  veteran  who  served  in  Operation  Enduring 
Freedom  in  Afghanistan  and/or  Operation  Iraqi  Freedom,  between 
the  age  of  20  and  60  years,  being  able  to  understand  the  protocol 
and  wihing  to  provide  written  informed  consent.  Consistent  with 
the  goal  of  having  extreme  phenotypes,  with  PTSD  being  the 
primary  difference  between  groups,  exclusion  criteria  included 
loss  of  consciousness  for  more  than  10  min,  neurological  disorders, 
current  alcohol  and  substance  abuse  disorders,  suicidality,  lifetime 
history  of  any  psychiatric  disorder  with  psychotic  features,  bipolar 
disorder  or  obsessive-compulsive  disorder. 

Doctoral  level  clinical  psychologists  conducted  structured  diag¬ 
nostic  interviews.  Criteria  for  the  PTSD  group  include  warzone 
exposure  and  related  PTSD  symptoms  of  at  least  three  months 
duration  as  indexed  by  the  clinician  administered  PTSD  scale 
(CAPS)  [Blake  et  al.,  1995],  consisting  of  the  sum  of  symptom 
frequency  and  intensity.  The  criteria  for  control  group  also  includ¬ 
ed  warzone  exposure,  but  subjects  did  not  meet  CAPS  criteria  for 
lifetime  or  current  combat  or  civilian  PTSD.  Warzone  exposure 
was  defrned  as  exposure  to  an  OEF/OIF  (GWOT)  Criterion  A  event 
and  was  confirmed  by  clinical  interview.  The  Structured  Clinical 
Interview  for  DSM-IV  Diagnosis  [Ventura  et  al.,  1998]  was  used  to 
diagnose  comorbid  disorders  and  to  assess  for  exclusion  criteria. 
The  military  version  of  the  PTSD  Checklist  (PCL-M)  [Weathers 
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et  al,  2013] ,  and  the  Beck  depression  index  (BDI)  [Beck  et  al,  1961] 
were  also  used  as  self-report  questionnaires  to  assess  symptom 
severity.  After  clinical  screening,  subjects  who  met  the  diagnosis 
criteria  for  PTSD  with  CAPS  score  >40  were  included  in  the  PTSD 
group  and  those  with  CAPS  score  <20  were  included  in  the  control 
group.  These  cut-offs  were  chosen  to  ensure  clinically  significant 
levels  of  symptomatology  in  the  cases,  and  to  exclude  sub-threshold 
cases  in  the  control  group.  Because  of  the  nature  ofwarzone  trauma 
and  the  recruitment  feasibility  of  combat  veterans,  PTSD  subjects 
in  the  present  study  were  chronic  cases,  and  both  groups  had 
undergone  prolonged  warzone  exposure.  Demographic  and  psy¬ 
chiatric  characteristics  of  the  discovery  sample  are  shown  in  Table  1. 
We  used  t-tests  to  determine  statistical  differences  between  cases/ 
controls  with  the  quantitative  data  and  chi-squared  tests  for 
categorical  data. 

Genetic  Data:  Quality  Control  and  Analyses 

Genotyping.  Using  DNA  extracted  from  blood  samples,  ge¬ 
nome-wide  SNP  genotyping  of  147  subjects  (we  note  that  one 


individual  withdrew  from  the  study)  from  the  SBPBC  discovery 
cohort  was  conducted  using  lllumina’s  HumanOmniExpress 
BeadChip.  After  calls  were  made  with  GenomeStudio  (lUumina 
Inc),  there  were  730,493  individual  SNPs  with  genotype  informa¬ 
tion.  We  performed  quality  control  on  the  SNP  data  using  PLINK 
[Purcell  et  al.,  2007]  (Supplementary  Fig.  1).  We  first  used  crude 
quality  control  filters  prior  to  performing  principal-component 
analysis  (PCA)  to  infer  axes  of  ancestry  (Supplementary  Fig.  2A). 
We  removed  3666  SNPs  for  low  caU  rates  (less  than  95%)  and 
16,786  SNPs  with  a  frequency  of  less  than  0.01.  AH  individuals 
had  call  rates  of  98%  or  greater.  Prior  to  PCA,  we  used  PLINK  to 
prune  the  autosome  only  data  in  windows  of  50  base  pairs, 
removing  one  SNP  from  each  pair  of  SNPs  with  r2>0.05  to  obtain 
a  set  of  roughly  independent  SNPs.  We  further  filtered  SNPs 
for  analysis  with  call  rates  of  less  than  or  equal  to  98%,  minor 
allele  frequencies  (MAE)  of  less  than  0.1,  and  Hardy-Weinberg 
equilibrium  (HWE)  P-values  of  less  than  1  x  10“®  (removing 
13,935,  137,997,  and  686  more  SNPs,  respectively).  After  quality 
control,  there  were  147  individuals  (130  males  and  17  females)  that 
were  genotyped  on  557,423  SNPs. 


TABLE  I.  Demographic  and  Clinical  Characteristics  of  the  Discovery  Sample  (SBPBC  Cohort]  Shown  as  Means  (SD)  or  Counts  [Percent] 


Total  (N  =  14?) 

PTSD  -  (N  =  G3] 

PTSD  (N  =  84] 

Age 

32.8  [8.0] 

33.6  (8.1) 

32.1  (8.0) 

Gender  (%  male] 

130  88.4% 

52  90.5% 

23  86.9% 

Race/ethnicity 

Hispanic 

5?  [38.8%) 

32  (50.8%) 

25  (29.8%) 

Non-hispanic  Asian 

6  [4.1%) 

0 

6  (2.1%) 

Non-hispanic  black 

35  [23.8%) 

16  (25.4%) 

19  (22.6%) 

Non-hispanic  white 

45  (30.6%) 

14  [22.2%] 

31  [36.9%] 

Non-hispanic  other 

4  [2.2%) 

1  (1.6%) 

3  (3.6%) 

Education 

Less  than  12th  grade 

3  [2.0%) 

1  (1.6%) 

2  (2.4%) 

High  school  grad  or  GED 

39  [26.5%) 

21  (33.3%) 

18  (21.4%) 

2  years  college,  AA  degree 

4?  [32.0%) 

22  (34.9%) 

25  (29.8%) 

4  years  college,  BA  degree 

43  (29.3%) 

12  [22.0%] 

26  [31.0%] 

Masters  degree 

14  (9.5%) 

2  (3.2%) 

12  (14.3%) 

Doctoral  degree 

1  (0.2%) 

0  [0%] 

1  (1.2%) 

Relationship  status 

Single 

59  (40.1%) 

22  (34.9%) 

32  (44.0%) 

Steady  relationship  or  living  together 

23  (15.6%) 

11  (12.5%) 

12  (14.3%) 

Married 

39  (26.5%) 

16  [25.4%] 

23  [22.4%] 

Divorced 

26  (12.1%) 

14  (22.2%) 

12  (14.3%) 

Early  trauma  exposure^' 

5.8  [4.9] 

6.9  (5.2) 

5.0  (4.2) 

PTSD  severity  [clinician-rated]'’’ 

31.2  (34.2) 

69.3  (12.0) 

3.5  [5.2] 

PTSD  severity  [self-reported]”' 

41.0  (20.3) 

60.9  (12.5) 

26.0  [8.9] 

Depression  severity'*' 

13.8  (13.0) 

24.2  (11.4) 

5.6  [6.3] 

Military  experience” 

36.1  (14.2) 

52.2  [12.3) 

30.2  [9.6] 

Negative  life  events*' 

10.4  (10.9) 

12.8  (12.0) 

5.1  [5.9] 

AA,  associate's  degree;  BA,  bachelor’s  degree;  GED,  general  educational  development;  PTSD,  post-traumatic  stress  disorder. 
^ETISR  total  [Bremner  et  al.,  2000]. 

^CAPS  [Blake  et  al.,  1995]. 

'^PCL-M  [Weathers  et  al.,  2013]. 

'^BDI  [Beck  et  al.,  1961]. 

^DRRI  D  [Vogt  et  al.,  2013]. 

^Life  events  scale  negative-past  12  months  [Sarason  et  al.,  1928]. 

><0.05. 
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Statistical  analyses  for  GWAS.  Using  the  statistical  package 
PLINK,  we  regressed  the  outcome  of  current  CAPS  score  on  allele 
count  assuming  an  additive  model  (0, 1  >  or  2  copies  of  the  risk  allele) , 
including  sex  and  the  top  three  principal  components  of  genome¬ 
wide  data  as  co variates.  We  note  that  we  used  the  first  three  principal 
components  within  the  discovery  sample  to  prevent  possible 
model  overfitting  and  loss  of  power.  To  verify  the  robustness  of 
the  top  SNP,  we  used  logistic  regression  to  test  for  the  association 
between  PTSD  diagnosis  (using  CAPS)  and  allele  count,  including 
the  same  covariates  as  above.  Given  the  mixed  ancestry  of  the 
discovery  samples,  we  also  tested  significant  variants  using  self- 
reported  race  instead  of  principal  components,  as  a  sensitivity 
analysis. 

Replication  Cohort 

Participants.  The  replication  cohort  consisted  of  study  par¬ 
ticipants  from  Grady  Memorial  Hospital  (Atlanta,  Georgia)  as  part 
of  the  GTP.  As  previously  shown  [Ressler  et  al.,  2011],  these 
participants  were  adult,  primarily  female,  highly  traumatized 
and  impoverished,  with  high  rates  of  PTSD.  To  minimize  genetic 
differences  due  to  ancestry,  individuals  within  three  standard 
deviations  of  the  medians  of  the  first  and  second  principal  com¬ 
ponents  of  self-reported  African  Americans,  were  selected  for 
analysis  [Almli  et  al.,  2014a]. 

Genetic  association  analysis.  Given  that  the  discovery  sample 
was  primarily  male,  and  prior  findings  of  genomic  differences  in 
PTSD  across  the  sexes  [Ressler  et  al.,  2011;  Gillespie  et  al.,  2013], 
we  conducted  an  analysis  stratified  by  sex.  Using  the  categorical 
measure  of  PTSD,  we  conducted  SNP -based  association  analysis 
in  the  GTP  samples  for  the  genome-wide  significant  SNP.  Analyses 
were  run  parallel  to  those  described  in  the  SBPBC  cohort.  In 
males  and  females  separately,  we  regressed  PTSD  diagnosis,  based 
on  DSM  A-D  criteria  using  responses  to  the  modified  PTSD 
Symptom  Scale  (PSS)  [Falsetti  et  al.,  1993],  on  aUele  count 
(0,  1,  or  2  risk  alleles)  as  above.  Due  to  the  larger  sample  size, 
we  used  the  top  10  principal  components  [Price  et  al.,  2006;  Lin  and 
Zhoa,  2009]  and  chip  type  (lUumina’s  HumanOmniExpress  or 
Omni  1- Quad  BeadChip)  as  covariates. 

Follow-Up  Analyses  in  Replication  Cohort 

Methylation  quantitative  trait  locus  analysis.  DNA  methyla- 
tion  was  assessed  as  previously  described  [Mehta  et  al.,  2013]. 
Briefly,  1  microgram  of  DNA  was  bisulfite-treated,  and  methyla¬ 
tion  at  >485,000  CpG  sites  was  interrogated  using  the  Human- 
Methylation450  BeadChip  (lUumina).  Beta  values  were  generated 
with  BeadStudio  and  were  set  to  missing  (no  call)  if  detection 
p-values  exceeded  .001.  CpGassoc  [Barfield  etal.,  2012]  was  used  to 
remove  samples  with  probe  detection  call  rates  <95%  and  those 
with  an  average  intensity  value  of  either  <50%  of  the  experiment¬ 
wide  sample  mean  or  <2,000  arbitrary  units  (AU).  In  addition, 
CpG  sites  with  missing  data  for  >10%  of  samples  were  excluded 
from  analysis.  Beta  Mixture  Quantile  dilation  (BMIQ)  was  used  to 
normalize  each  dataset  [Teschendorff  et  al.,  2013].  Individual  level 
data  from  the  1 1  CpG  sites  within  1  MB  of  rs717947  were  examined 
for  this  study.  For  each  CpG  site,  methylation  is  represented  as  the 


average  of  methylated  and  unmethylated  DNA  across  all  cells  that 
make  up  an  individual  DNA  sample. 

Given  that  rs7 17947  is  located  in  an  intergenic  region  that 
appears  to  be  epigenetically-regulated  based  on  human  postmor¬ 
tem  prefrontal  cortex  methylation  data  [Maunakea  et  al.,  2010], 
we  identified  methylation  quantitative  trait  loci  (meQTL)  in 
the  GTP  cohort  by  applying  the  approach  described  previously 
[Smith  et  al.,  2014]  to  the  HumanMethylation450  data.  Using  R 
(www.R-project.org),  the  relationship  between  the  proportion  of 
methylation  at  the  CpG  site  and  SNP  was  examined  via  linear 
regression,  where  methylation  was  modeled  as  a  linear  function  of 
the  number  of  reference  alleles  (0, 1,  or  2)  with  sex,  age  and  the  top 
three  principal  components  from  GWAS  as  covariates.  Thus,  an 
association,  if  found  between  genotype  and  methylation,  represents 
genotype-dependent  averages  across  the  entire  cohort.  We  also 
performed  the  same  analysis  in  females  only,  consistent  with  the 
strategy  described  for  SNP  replication  above.  We  examined  1 1  CpG 
sites  and  adjusted  for  each  as  an  independent  test;  thus,  P  <  0.0045 
(0.05/11  sites)  was  noted  as  statistically  significant. 

Neuroimaging.  To  determine  the  neural  correlates  of  the 
PTSD  risk  allele,  we  examined  fMRI  data  among  a  subset  of 
traumatized  women  drawn  from  the  GTP  replication  cohort. 
Participants  completed  a  task  designed  to  engage  threat-processing 
networks,  passively  viewing  static  fearful  and  neutral  face  stimuli. 
Fearful  and  neutral  face  stimuli  were  presented  in  a  block  design. 
Trials  included  a  face  stimulus  presented  for  500  ms,  followed  by  a 
500  ms  presentation  of  a  fixation  cross.  Subjects  were  instructed  to 
pay  attention  to  the  faces,  and  did  not  make  any  behavioral 
response,  to  minimize  motion  artifacts  and  neural  activation 
unrelated  to  processing  the  visual  stimulus.  Detailed  procedures 
and  data  processing  methods  are  described  elsewhere  [Stevens 
et  al.,  2013]. 

RESULTS 

The  demographic  characteristics  of  the  SBPBC  subjects  are 
shown  in  Table  I.  The  SBPBC  cohort  had  an  overall  mean  of 
31.7  (SD  =  34.7,  range  0-136)  for  PTSD  symptom  scores;  how¬ 
ever,  because  the  cohort  was  designed  with  extreme  phenotypes, 
symptom  severity  was  much  higher  in  cases  [PTSD-R:  mean 
=  69.4  (16.9)]  than  controls  [PTSD-:  3.5  (5.2)].  We  report 
that  the  demographic  characteristics  were  similar  between 
cases  and  controls  (P>  0.05).  The  following  psychiatric  variables. 
Early  Trauma  Exposure,  PTSD  severity  (clinician-rated),  PTSD 
severity  (self-reported).  Depression  severity,  and  Negative  Life 
Events,  showed  differences  between  cases  and  controls,  with  cases 
endorsing  higher  symptoms. 

GWAS 

Using  the  military  SBPBC  cohort,  we  conducted  a  GWAS  using  a 
quantitative  measure  of  PTSD  symptoms  as  the  outcome  with  sex, 
and  the  top  three  principal  components  as  covariates.  There  was  no 
evidence  to  suggest  inflation  of  test  statistics  in  our  GWAS  given  a 
lambda  of  1.00  (Supplementary  Fig.  2B).  The  GWAS  yielded  one 
genome-wide  significant  SNP,  rs717947,  at  chromosome  4pl5 
(N=  147,  (3  (SE)  =  31.34  (5.19),  P=  1.28  x  10“®)  that  associated 
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with  current  CAPS  score  (Fig.  lA).  SNP  rs717947  had  a  MAP  of 
0.18,  and  a  HWE  p-value  of  1  (4  TT/45  TC/98  CC)  in  the  whole 
sample  and  0. 1 8  in  controls.  Although  most  of  the  top  SNPs  for  the 
GWAS  were  intragenic,  with  the  exception  of  Collagen,  Type  IV, 
Alpha  2  {COL4A2),  several  loci  are  represented  4pl5.1,  12ql5, 
20pl2.1,  with  SNPs  in  likely  linkage  disequilibrium  (Table  II). 
Sensitivity  analysis  covarying  for  self-reported  race  instead  of  the 
top  three  principal  components  showed  similar  results  for  the 
association  between  rs717947  and  PTSD  symptoms  (data  not 
shown).  The  risk  allele  (T)  carriers  consistently  show  higher 
CAPS  scores  compared  to  individuals  with  the  CC  genotype 
[mean  (SD):  TT/TC  =  52.7  (34.7),  CC  =  21.5  (29.8)]  (Supplemen¬ 
tary  Fig.  3A).  Follow-up  analysis  of  SNP  rs7 17947  showed  that  this 


BC036345 


^APAP2 

OrHOt- 


— I - 1 - 1 - 1 - 1 - 1— 

31  32  33  34  35  36 

Position  on  chr4  (Mb) 


FIG.  1.  Chromosome  4  SNP  is  Associated  with  PTSD  at  a  Genome- 
Wide  Level.  (A)  Manhattan  plot  for  PTSD  symptoms  as  measured 
by  the  CAPS  in  the  SGPBC  cohort  (N  =  147]  showing  a  peak  on 
chromosome  4,  with  top  SNP  rs717947  (P  (SE)  =  31.34  (5.19), 
P=  1.28  X  10  °].  [B]  Regional  plot  of  chromosome  4  showing  a 
peak  of  SNPs  with  low  p-values  near  the  genome-wide  significant 
hit  (rs71794?  shown  as  a  blue  diamond]  and  non-coding  RNA 
BC036345  (chr4:  33,897,961-34,041,515].  Plot  generated  using 
LocusZoom  (http;//csg.sph.umich.edu/locuszoom/]. 


SNP  is  robust  to  different  measures  of  PTSD,  both  categorical  and 
self-reported  measures.  Using  the  categorical  PTSD  diagnosis 
measure  (based  on  CAPS)  as  an  outcome  revealed  an  odds  ratio 
(OR)  of  8.6  (N=  147,  95%  confidence  interval  (Cl)  =  3.5-21.6, 
P=  3.84  X  10“®)  (Fig.  2A).  Given  that  the  discovery  sample  was  an 
extreme  phenotype  design  based  on  GAPS,  we  verified  that  self- 
reported  PTSD  symptoms  (not  used  for  initial  sample  selection) 
produced  similar  results  (N  =  147,  (3  (SE)  =  17.37  (3.10),  P=  1.203 
X  10“^).  Plots  of  self-reported  PTSD  symptoms  are  similar  to 
symptoms  using  CAPS  (Supplementary  Fig.  3B). 

Given  uncertainty  on  whether  trauma  exposure  should  be 
considered  as  a  covariate  in  GWAS  of  PTSD,  we  next  examined 
whether  the  main  effect  of  rs7 17947  would  be  significant  for  CAPS 
scores  after  controlling  for  early  childhood  trauma  exposure  or 
overall  negative  life  events.  In  both  cases,  the  SNP  remained 
genome-wide  significant  [Covariate:  Early  Trauma  Inventory, 
N  =  146,  (3  (SE)  =  31.5  (5.2),  P=  1.13  x  10“®;  Negative  life  events, 
N=146,  (3  (SE)  =  24.9  (4.3),  P=3.29x  10“®]. 

Replication  of  rs?l?94? 

Using  the  civilian  GTP  cohort,  we  attempted  to  replicate  the 
association  between  SNP  rs717947  and  PTSD  diagnosis.  Due  to 
our  past  findings  of  SNPs  differentially  associating  with  PTSD  in 
males  and  females  (e.g.,  Ressler  et  ah,  2011)  in  addition  to  the  male 
majority  of  the  discovery  sample,  we  stratified  our  analyses  by  sex. 
In  females,  we  found  that  carriers  of  the  risk  allele  (T)  had  increased 
odds  of  PTSD  diagnosis  (N=2006,  OR=1.25,  CI=1. 1-1.5, 
p=0.005)  (Fig.  2B);  in  contrast,  no  association  was  found  in  males 
(N=862,  p=0.37).  Follow-up  analyses  showed  that  6  of  the  9  SNPs 
in  the  regional  peak  on  chromosome  4  identified  from  the  discov¬ 
ery  sample  also  associate  with  PTSD  diagnosis  in  the  same  direction 
in  the  GTP  replication  cohort  (nominal  p<0.1;  Supplementary 
Table  I).  We  also  note  that  in  the  GTP  sample,  this  SNP,  rs717947, 
was  not  associated  with  comorbid  disorders,  such  as  depression  as 
measured  by  BDI  (p=0.8),  alcohol  abuse  by  AUDIT  (p=0.4),  or 
drug  abuse  by  DAST  (p=0.6). 


rs?l?94?  is  a  Methylation  QTL 

Although  the  gene  structure  and  function  of  this  region  of  chro¬ 
mosome  4  is  unclear,  bioinformatics  analyses  suggest  that  this  peak 
of  association  is  within  an  intergenic  region  that  appears  to  be 
epigenetically-regulated  based  on  postmortem  prefrontal  cortex 
methylation  data  (Supplementary  Fig  4).  Therefore,  we  examined 
whether  the  GWAS-associated  SNP  is  potentially  a  functional  SNP, 
based  on  its  association  with  DNA  methylation  in  a  subset  of  the 
full  GTP  cohort  with  methylation  data.  Of  the  1 1  CpG  sites  within 
1  MB  of  the  SNP,  one  probe  was  found  to  be  significant  after 
multiple  test  correction  (Supplementary  Table  II).  The  number  of 
risk  alleles  of  rs717947  predicted  the  proportion  of  methylation 
from  probe  cg09242288  [N=157,  (3  (SE)  = -0.02  (0.007), 
P=  0.002]  in  the  overall  cohort  (Fig.  2G),  indicating  that  the 
SNP  is  an  meQTL.  Though  power  to  detect  associations  was 
reduced  when  the  analysis  was  restricted  to  females  (N  =  99), 
the  effect  sizes  was  equivalent  p  =  —0.02.  Methylation  of 
cg09242288  did  not  predict  PTSD  independent  of  genotype. 
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TABLE  II.  Top  10  Variants  From  GWAS  of  PTSO  Symptoms  as  Measured  by  the  CAPS 


Chr 

Gene 

Approx  distance  to  downstream  gene 

SNP 

BP  location 

Ailele 

P  (SE) 

P-value 

4 

intergenic 

2629  kb,  DTHDl 

rs212942 

33653252 

T 

31.34  [5.19] 

1.28  X  10 

4 

intergenic 

2422  kb,  DTHDl 

rsl3104923 

33810391 

6 

29.04  [5.23] 

1.35  X  10 

13 

C0L4A2 

intragenic 

rs4223155 

110921066 

A 

20.03  [3.92] 

1.35  X  10 

4 

intergenic 

2536  kb,  DTHDl 

rs2684329 

33242424 

A 

22.04  (5.38] 

1.42  X  10 

12 

intergenic 

423  kb,  RAPID 

rsl2104851 

68531344 

A 

-23.24  (4.83] 

2.42  X  10 

12 

intergenic 

458  kb,  RAPID 

rs3181035 

68546396 

A 

-23.24  (4.83] 

2.42  X  10 

20 

intergenic 

1069  kb,  PCSK2 

rs4239215 

16138249 

G 

18.00  (3.62] 

2.51  X  10 

20 

intergenic 

1065  kb,  PCSK2 

rs6080152 

16141991 

6 

-18.95  (3.90] 

3.14  X  10 

20 

intergenic 

1101  kb,  PCSK2 

rs6080131 

16106040 

G 

18.59  (3.85] 

3.48  X  10 

4 

intergenic 

2148  kb,  DTHDl 

rsl3113036 

34135538 

C 

23.19  (4.92] 

5.25  X  10 

SNPs  on  chromosome  4  are  likely  in  linkage  disequilibrium  [shown  in  Fig.  lb]. 


Neuroimaging  Correlates  of  rs?l?94? 

We  next  wished  to  examine  whether  this  SNP  may  be  associated 
with  an  intermediate  neural  phenotype  related  to  fear  or  threat, 
using  prior  data  from  fMRI  with  a  fearful  faces  task  in  a  subset  of 
the  civilian  GTP  cohort.  The  subset  of  females  with  fMRI 
data  did  not  appear  to  significantly  differ  on  the  clinical  or 
demographic  variables  from  the  full  genetic  sample  of  GTP 
females  (Supplementary  Table  III).  To  determine  brain  regions 
whose  response  to  threat  stimuli  was  influenced  by  genotype,  we 
conducted  a  whole-brain  analysis  of  the  response  to  fearful 
relative  to  neutral  pictures  (Fig.  3,  Supplementary  Table  IV). 
We  observed  a  linear  association  between  the  number  of  risk 
alleles  and  decreasing  activation  in  bilateral  dorsolateral  prefron¬ 
tal  cortex  (dlPFG)  and  dorsomedial  prefrontal  cortex  (dmPFC). 
These  regions  have  previously  been  shown  to  play  key  roles 
in  emotion  regulation  (e.g.,  Ochsner  et  ah,  2002),  and 
decreased  dlPFC  activation  has  been  linked  with  PTSD  symptoms 
[Aupperle  et  ah,  2012;  Fani  et  ah,  2012]. 

DISCUSSION 

Identified  in  a  small  military  cohort  designed  based  on  extreme 
phenotypes,  we  found  a  genome-wide  signihcant  SNP  that  asso¬ 
ciates  with  PTSD  that  replicates  in  a  much  larger  civilian  commu¬ 
nity  cohort.  In  the  discovery  sample  (SBPBC),  the  genome-wide 
significant  SNP,  rs717947,  at  chromosome  4pl5  associated  with 
PTSD  symptoms  from  the  CAPS,  as  well  as  the  PTSD  diagnostic 
measure  from  the  CAPS.  Subsequently,  we  replicated  the  top  SNP, 
as  well  as  several  others  within  the  association  peak,  with  PSS- 
determined  PTSD  diagnosis  from  the  much  larger  traumatized 
civilian  cohort.  Using  the  GTP  replication  cohort,  our  data  also 
suggest  that  rs7 17947  may  be  functionally  signihcant  as  the  SNP 
was  found  to  be  an  meQTL,  and  the  risk  allele  of  rs717947  was 
associated  with  altered  medial  and  dorsolateral  prefrontal  activa¬ 
tion  to  fearful  faces. 

These  hndings  are  notable  as  very  few  GWAS  of  PTSD  have  yet 
been  published,  and  of  those,  replication  has  been  limited. 
Furthermore,  functional  status  of  the  prior  GWAS  SNPs  has 
been  unclear  at  the  genomic  or  neural  levels.  Our  data  add  to  this 


literature  in  identifying  a  novel  candidate  region  in  chromosome 
4  that  appears  to  be  a  region  of  moderate  epigenetic  regulation. 
Using  human  tissue  derived  from  postmortem  brain,  a  research 
team  from  the  Department  of  Neurosurgery,  UC  San  Francisco, 
investigated  DNA  methylation  at  24  out  of  28  million  CpG  sites 
across  the  haploid  genome  and  found  that  a  majority  of  cytosine 
methylation  (5mC)  occurs  in  intronic  and  intergenic  regions 
[Maunakea  et  ah,  2010].  In  fact,  less  than  3%  of  GpG  islands 
located  in  5’  promoter  regions  are  methylated.  While  a  lot  of 
emphasis  has  historically  been  focused  on  methylation  at  the  5’ 
promoter  region  for  its  role  in  regulation  of  gene  expression, 
intergenic  DNA  methylation  may  also  play  a  key  role  in  gene 
regulation,  cell-type  specificity,  and  epigenetic  response  to  envi¬ 
ronment.  Li  et  al  have  recently  shown  that  distinct  populations  of 
neurons  have  differential  methylation  at  intergenic  loci  [Li  et  al., 
2014].  Furthermore,  they  suggest  that  methylation  differences 
within  these  intergenic  regions  may  be  a  result  of  experience- 
induced  methylation  that  contributes  to  the  development  of 
neuropsychiatric  disorders.  Interestingly,  in  silica  analysis  of 
the  genomic  region  surrounding  rs717947  (200  bp)  reveals  a 
strong  signal  for  DNA  methylation  (MeDIP)  within  the 
brain,  suggesting  that  this  intergenic  region  can  be  methylated 
(Supplementary  Fig.  4).  In  fact,  rs717947  may  also  be  part  of  a 
CpG  dinucleotide.  Given  this  scenario,  it  is  reasonable  to  assume 
that  if  this  SNP  is  indeed  a  CpG,  a  T  aUele,  in  place  of  the 
ancestral  C  allele  would  eliminate  the  potential  for  DNA  meth¬ 
ylation  at  this  site.  This  is  highly  speculative  but  could  be  tested 
further  by  methylation  specific  PGR  or  pyro  sequencing  of 
bisulfite  treated  DNA  at  this  particular  SNP. 

The  region  at  4p  1 5  may  be  a  distal  regulatory  region  in  cis  of  one 
or  more  of  the  neighboring  genes,  PCDH7,  ARAP2,  and  DTHDl, 
or  that  this  region  of  DNA  may  harbor  yet  unknown  functions  such 
as  noncoding  RNA  (ncRNA)  or  chromatin  regulatory  regions. 
Indeed,  the  peak  of  SNPs  likely  in  linkage  disequilibrium  with  the 
genome-wide  significant  SNP  on  chromosome  4  are  in,  or  very 
close  to,  the  ncRNA  BC036345  (chr4:  33,897,961-34,041,515)  of 
unknown  function.  However,  further  work  needs  to  be  performed 
to  identify  the  effect  of  the  ncRNA  within  this  region.  Related  to  its 
possible  roles  in  these  neighboring  genes,  of  note  PCDH7  (proto- 
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FIG.  2.  Replication  of  Genome-wide  PTSD-associated  SNP  and 
Evidence  for  an  meOTL.  Replication  results  for  associations  of 
genome-wide  significant  SNP,  rs717942,  with  PTSD  diagnosis  in 
the  (A)  discovery  cohort  (N  =  14?,  0R  =  8.G  Cl  =  3.5-21.6, 
P=3.84x  10  ^].  and  (8)  replication  cohort  (females  only, 

N  =  200G,  OR  =  1.25,  Cl  =  1.1-1.5,  P  =  0.005).  (C)  Of  the  11 
CpG  within  1  MB  of  the  genome-wide  SNP,  rs212942  affects 
expression  levels  (probe  cg09242288]  with  the  risk  genotype 
(TT)  showing  decreased  proportion  of  methylation  (N  =  152,  |3 
(SE)  =  -0.02  (0.002),  P=  0.002]. 


cadherin7)  is  expressed  witliin  the  brain  [Herteletal.,2012],andit 
has  recently  been  associated  with  sleep  regulation,  epilepsy,  and 
neuro cognitive  function  (e.g.,  OUila  et  al.,  2014).  ARAP2  is  also 
expressed  in  cortex  and  hippocampus,  and  appears  to  be  involved 
in  cellular  cytoskeletal  dynamics  and  endosome  regulation  (e.g., 
Chen  et  al.,  2013).  Less  is  known  about  DTHDl  functioning,  but  it 
has  been  associated  with  retinal  dystrophy  [Abu-Safieh  et  al.,  2013] . 
Thus  all  of  the  genes  surrounding  this  SNP  have  potentially 
interesting  characteristics  as  candidates  expressed  within  the  cen¬ 
tral  nervous  system  (CNS)  and  putatively  associated  with  other 
forms  of  CNS  pathology. 

To  further  examine  whether  there  were  any  associations 
between  rs717947  and  intermediate  phenotypes  associated 
with  PTSD,  we  examined  brain  activation  during  a  fear  and 
threat  processing  task — passive  viewing  of  fearful  faces,  which  we 
and  others  have  previously  associated  with  differential  responses 
to  PTSD  and  PTSD-related  genetic  pathways  [Andero  et  al.,  2013; 
Stevens  et  al.,  2013;  Stevens  et  al.,  2014].  We  found  that  when 
confronted  with  threat  stimuli,  individuals  carrying  the  rs717947 
risk  allele  showed  decreased  activation  among  prefrontal  regions 
that  regulate  negative  emotion  (dlPFC  and  dmPFC).  Dysregu- 
lation  of  negative  emotion,  whether  related  to  inability  to  extin¬ 
guish  fear,  hypervigilance  to  threatening  cues,  or  generalization 
of  fearful  stimuli,  are  all  important  intermediate  phenotypes 
related  to  PTSD.  These  data  suggest  that  SNPs  within  this  region 
of  chromosome  4  associated  with  PTSD  and  differential  genomic 
methylation,  are  also  associated  with  differential  intermediate 
fear-related  neural  phenotypes. 

There  are  several  limitations  of  this  study.  The  sample  size  of  the 
discovery  cohort  (SBPBC)  was  very  small;  however,  we  believe  that 
the  extreme  phenotype  design  and  well-controlled  nature  of  the 
study  enhance  its  power  to  determine  genetic  differences  between 
cases  and  controls.  While  the  extreme  phenotype  design  may  yield 
interesting  information,  traditional  modeling  of  the  phenotype  is 
challenging.  Follow-up  studies  using  all  continuous  values  of  this 
phenotype  are  essential  for  confirmation  of  rs7 17947  as  a  risk 
variant  for  PTSD.  We  have  used  alternative  PTSD  phenotypes,  not 
based  on  extreme  phenotype  thresholds,  to  assess  the  robustness 
of  the  association.  The  cohort  is  also  of  mixed  race/ethnicity. 
Although  we  used  principal  components  to  control  for  ancestry 
differences  in  the  samples,  it  is  possible  that  the  susceptibility  for 
disease  in  risk  allele  carriers  is  actually  driven  by  subtle  differences 
in  the  genetic  backgrounds  of  the  different  races.  Additionally,  the 
discovery  and  replication  cohorts  are  quite  different,  particularly  in 
sex  and  trauma  type.  Finally,  almost  half  of  the  cases  in  the 
discovery  cohort  had  comorbid  depression,  a  level  of  comorbidity 
common  across  PTSD  patients  and  seen  within  our  civilian 
cohort  as  well.  Further  research  is  necessary  to  determine  whether 
SNP  rs7 17947  is  specific  to  PTSD  or  to  comorbid  PTSD  and 
depression. 

In  summary,  we  have  identified  a  novel  genomic  locus  associ¬ 
ated  with  PTSD  symptoms  and  diagnosis  across  two  very  differ¬ 
ent  cohorts — one  military,  primarily  of  mixed  race  and  male,  and 
one  civilian,  African  American,  and  primarily  female.  The  obser¬ 
vation  will  need  further  replication  across  other  cohorts.  How¬ 
ever,  the  combination  of  significance  across  mixed  samples, 
association  with  differential  regional  DNA  methylation,  and 
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FIG.  3.  Oorsomedial  and  Dorsolateral  PFC  Differentially  Respond  to  Fearful  Faces  as  a  function  of  Genotype.  Functional  MRI  was  used  to 
examine  whether  the  PTSD-associated  SNP  was  related  to  differential  brain  regional  activation  with  fearful  cues.  (A)  Example  of  neutral  and 
fearful  face  stimuli  from  the  fearful  faces  task.  [B]  Axial  section  showing  medial  and  lateral  prefrontal  signal  associated  with  genotype. 
Significant  clusters  associated  with  genotype  are  shown  at  a  whole-brain  corrected  threshold  of  P<D.05,  N  =  53  (33  CC,  16  TC,  4  TT).  (C) 
Dorsolateral  and  [D]  Dorsomedial  regions  surviving  whole  brain  correction.  Images  are  shown  in  neurological  orientation,  on  template  slices  in 
Montreal  Neurological  Institute  (MNI)  space.  Correction  for  multiple  comparisons  used  a  combined  height-extent  threshold  calculated  using 
monte  carlo  simulation  in  Alphasim,  with  1,DDD  iterations  and  a  cluster-forming  threshold  of  P<D.01. 


association  with  neural  intermediate  phenotypes  related  to  PTSD 
are  all  compelling,  and  suggest  this  region  as  a  potentially 
important  genomic  region  for  further  study.  Further  genome 
wide  association  studies  for  PTSD,  such  as  those  currently 
planned  within  the  Psychiatric  Genomics  Consortium  PTSD 
workgroup  [Koenen  et  ah,  2013],  will  continue  to  dissect  the 
genomic  architecture  underlying  PTSD,  with  hope  for  new 
biomarkers  and  therapeutic  targets. 
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Additional  supporting  information  may  be  found  in  the  online 

version  of  this  article  at  the  publisher’s  web-site. 


